Overview

Dataset statistics

Number of variables30
Number of observations1615
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory378.6 KiB
Average record size in memory240.1 B

Variable types

NUM26
CAT2
BOOL2

Reproduction

Analysis started2020-07-08 08:46:45.129054
Analysis finished2020-07-08 08:48:24.836283
Duration1 minute and 39.71 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

VolumeCred_CA is highly correlated with VolumeCredHigh correlation
VolumeCred is highly correlated with VolumeCred_CAHigh correlation
TransactionsCred_CA is highly correlated with TransactionsCredHigh correlation
TransactionsCred is highly correlated with TransactionsCred_CAHigh correlation
VolumeDeb_CA is highly correlated with VolumeDebHigh correlation
VolumeDeb is highly correlated with VolumeDeb_CAHigh correlation
TransactionsDeb_CA is highly correlated with TransactionsDebHigh correlation
TransactionsDeb is highly correlated with TransactionsDeb_CAHigh correlation
VolumeDebCash_Card is highly skewed (γ1 = 20.57539539) Skewed
Client has unique values Unique
Tenure has 19 (1.2%) zeros Zeros
Count_SA has 1189 (73.6%) zeros Zeros
Count_MF has 1309 (81.1%) zeros Zeros
Count_CL has 1480 (91.6%) zeros Zeros
ActBal_CA has 94 (5.8%) zeros Zeros
ActBal_SA has 1205 (74.6%) zeros Zeros
ActBal_MF has 1420 (87.9%) zeros Zeros
ActBal_OVD has 1495 (92.6%) zeros Zeros
ActBal_CC has 1461 (90.5%) zeros Zeros
ActBal_CL has 1481 (91.7%) zeros Zeros
VolumeCred has 42 (2.6%) zeros Zeros
VolumeCred_CA has 58 (3.6%) zeros Zeros
TransactionsCred has 42 (2.6%) zeros Zeros
TransactionsCred_CA has 58 (3.6%) zeros Zeros
VolumeDeb has 94 (5.8%) zeros Zeros
VolumeDeb_CA has 100 (6.2%) zeros Zeros
VolumeDebCash_Card has 659 (40.8%) zeros Zeros
VolumeDebCashless_Card has 733 (45.4%) zeros Zeros
VolumeDeb_PaymentOrder has 463 (28.7%) zeros Zeros
TransactionsDeb has 94 (5.8%) zeros Zeros
TransactionsDeb_CA has 100 (6.2%) zeros Zeros
TransactionsDebCash_Card has 659 (40.8%) zeros Zeros
TransactionsDebCashless_Card has 733 (45.4%) zeros Zeros
TransactionsDeb_PaymentOrder has 463 (28.7%) zeros Zeros

Variables

Client
Real number (ℝ≥0)

UNIQUE

Distinct count1615
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean808.0
Minimum1
Maximum1615
Zeros0
Zeros (%)0.0%
Memory size12.6 KiB

Quantile statistics

Minimum1
5-th percentile81.7
Q1404.5
median808
Q31211.5
95-th percentile1534.3
Maximum1615
Range1614
Interquartile range (IQR)807

Descriptive statistics

Standard deviation466.3546576
Coefficient of variation (CV)0.5771716059
Kurtosis-1.2
Mean808
Median Absolute Deviation (MAD)404
Skewness0
Sum1304920
Variance217486.6667
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
161510.1%
 
108410.1%
 
106410.1%
 
106610.1%
 
106810.1%
 
107010.1%
 
107210.1%
 
107410.1%
 
107610.1%
 
107810.1%
 
Other values (1605)160599.4%
 
ValueCountFrequency (%) 
110.1%
 
210.1%
 
310.1%
 
410.1%
 
510.1%
 
ValueCountFrequency (%) 
161510.1%
 
161410.1%
 
161310.1%
 
161210.1%
 
161110.1%
 

Sex
Categorical

Distinct count3
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
0
856
1
756
2
 
3
ValueCountFrequency (%) 
085653.0%
 
175646.8%
 
230.2%
 

Length

Max length1
Median length1
Mean length1
Min length1

Age
Real number (ℝ≥0)

Distinct count94
Unique (%)5.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean42.84891640866873
Minimum0
Maximum97
Zeros1
Zeros (%)0.1%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile16
Q129
median41
Q357
95-th percentile73
Maximum97
Range97
Interquartile range (IQR)28

Descriptive statistics

Standard deviation18.5505294
Coefficient of variation (CV)0.4329287869
Kurtosis-0.5551439501
Mean42.84891641
Median Absolute Deviation (MAD)14
Skewness0.1792470986
Sum69201
Variance344.122141
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
40412.5%
 
39412.5%
 
26372.3%
 
27372.3%
 
37372.3%
 
32362.2%
 
29342.1%
 
38342.1%
 
30332.0%
 
34332.0%
 
Other values (84)125277.5%
 
ValueCountFrequency (%) 
010.1%
 
160.4%
 
250.3%
 
330.2%
 
490.6%
 
ValueCountFrequency (%) 
9710.1%
 
9410.1%
 
9310.1%
 
9210.1%
 
9010.1%
 

Tenure
Real number (ℝ≥0)

ZEROS

Distinct count248
Unique (%)15.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean101.33993808049536
Minimum0
Maximum273
Zeros19
Zeros (%)1.2%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile8
Q144
median97
Q3151
95-th percentile205
Maximum273
Range273
Interquartile range (IQR)107

Descriptive statistics

Standard deviation64.91729737
Coefficient of variation (CV)0.6405894715
Kurtosis-0.9853661042
Mean101.3399381
Median Absolute Deviation (MAD)54
Skewness0.1903179728
Sum163664
Variance4214.255498
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
150704.3%
 
151633.9%
 
152493.0%
 
181362.2%
 
149322.0%
 
0191.2%
 
20181.1%
 
33161.0%
 
8161.0%
 
176140.9%
 
Other values (238)128279.4%
 
ValueCountFrequency (%) 
0191.2%
 
170.4%
 
290.6%
 
3100.6%
 
480.5%
 
ValueCountFrequency (%) 
27310.1%
 
27110.1%
 
26810.1%
 
26710.1%
 
26610.1%
 

Count_CA
Categorical

Distinct count4
Unique (%)0.2%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
1
1515
2
 
77
3
 
19
4
 
4
ValueCountFrequency (%) 
1151593.8%
 
2774.8%
 
3191.2%
 
440.2%
 

Length

Max length1
Median length1
Mean length1
Min length1

Count_SA
Real number (ℝ≥0)

ZEROS

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.3077399380804954
Minimum0.0
Maximum5.0
Zeros1189
Zeros (%)73.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile1
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5676388905
Coefficient of variation (CV)1.844540862
Kurtosis6.141804467
Mean0.3077399381
Median Absolute Deviation (MAD)0
Skewness2.115775757
Sum497
Variance0.32221391
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0118973.6%
 
136922.8%
 
2452.8%
 
3110.7%
 
510.1%
 
ValueCountFrequency (%) 
0118973.6%
 
136922.8%
 
2452.8%
 
3110.7%
 
510.1%
 
ValueCountFrequency (%) 
510.1%
 
3110.7%
 
2452.8%
 
136922.8%
 
0118973.6%
 

Count_MF
Real number (ℝ≥0)

ZEROS

Distinct count30
Unique (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8860681114551083
Minimum0.0
Maximum79.0
Zeros1309
Zeros (%)81.1%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4
Maximum79
Range79
Interquartile range (IQR)0

Descriptive statistics

Standard deviation3.871786274
Coefficient of variation (CV)4.369626019
Kurtosis169.3170164
Mean0.8860681115
Median Absolute Deviation (MAD)0
Skewness11.03108251
Sum1431
Variance14.99072895
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0130981.1%
 
11026.3%
 
2533.3%
 
3422.6%
 
4311.9%
 
6140.9%
 
5140.9%
 
7100.6%
 
1160.4%
 
850.3%
 
Other values (20)291.8%
 
ValueCountFrequency (%) 
0130981.1%
 
11026.3%
 
2533.3%
 
3422.6%
 
4311.9%
 
ValueCountFrequency (%) 
7910.1%
 
6410.1%
 
4510.1%
 
3510.1%
 
3210.1%
 

Count_OVD
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
0
1196
1
419
ValueCountFrequency (%) 
0119674.1%
 
141925.9%
 

Count_CC
Boolean

Distinct count2
Unique (%)0.1%
Missing0
Missing (%)0.0%
Memory size12.6 KiB
0
1445
1
 
170
ValueCountFrequency (%) 
0144589.5%
 
117010.5%
 

Count_CL
Real number (ℝ≥0)

ZEROS

Distinct count5
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.09907120743034056
Minimum0.0
Maximum5.0
Zeros1480
Zeros (%)91.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile1
Maximum5
Range5
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.3608373564
Coefficient of variation (CV)3.642202066
Kurtosis34.28049764
Mean0.09907120743
Median Absolute Deviation (MAD)0
Skewness4.832215521
Sum160
Variance0.1302035978
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0148091.6%
 
11157.1%
 
2171.1%
 
320.1%
 
510.1%
 
ValueCountFrequency (%) 
0148091.6%
 
11157.1%
 
2171.1%
 
320.1%
 
510.1%
 
ValueCountFrequency (%) 
510.1%
 
320.1%
 
2171.1%
 
11157.1%
 
0148091.6%
 

ActBal_CA
Real number (ℝ≥0)

ZEROS

Distinct count1514
Unique (%)93.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2438.601940513048
Minimum0.0
Maximum171575.88964285716
Zeros94
Zeros (%)5.8%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q161.56214286
median462.2217857
Q32174.864286
95-th percentile10632.11825
Maximum171575.8896
Range171575.8896
Interquartile range (IQR)2113.302143

Descriptive statistics

Standard deviation7072.77735
Coefficient of variation (CV)2.900341065
Kurtosis221.4121637
Mean2438.601941
Median Absolute Deviation (MAD)456.1639286
Skewness11.52805078
Sum3938342.134
Variance50024179.44
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0945.8%
 
28.5714285730.2%
 
7.14285714320.1%
 
35.7142857120.1%
 
0.388214285720.1%
 
0.000357142857120.1%
 
178.571428620.1%
 
142.857142920.1%
 
4.5510.1%
 
379.348571410.1%
 
Other values (1504)150493.1%
 
ValueCountFrequency (%) 
0945.8%
 
0.000357142857120.1%
 
0.00142857142910.1%
 
0.00214285714310.1%
 
0.0103571428610.1%
 
ValueCountFrequency (%) 
171575.889610.1%
 
74934.9560710.1%
 
60226.1417910.1%
 
55934.0203610.1%
 
55383.7978610.1%
 

ActBal_SA
Real number (ℝ≥0)

ZEROS

Distinct count411
Unique (%)25.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4009.8127808491818
Minimum0.0
Maximum389883.8307142857
Zeros1205
Zeros (%)74.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30.01
95-th percentile20632.69264
Maximum389883.8307
Range389883.8307
Interquartile range (IQR)0.01

Descriptive statistics

Standard deviation17909.06155
Coefficient of variation (CV)4.466308659
Kurtosis166.5740119
Mean4009.812781
Median Absolute Deviation (MAD)0
Skewness10.5017382
Sum6475847.641
Variance320734485.4
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0120574.6%
 
0.481071428610.1%
 
5352.29428610.1%
 
16144.0021410.1%
 
5069.31571410.1%
 
5384.08642910.1%
 
3930.25510.1%
 
35768.3885710.1%
 
24364.76510.1%
 
7139.762510.1%
 
Other values (401)40124.8%
 
ValueCountFrequency (%) 
0120574.6%
 
0.000714285714310.1%
 
0.00142857142910.1%
 
0.00178571428610.1%
 
0.00714285714310.1%
 
ValueCountFrequency (%) 
389883.830710.1%
 
219368.646410.1%
 
191591.189610.1%
 
173499.862910.1%
 
128617.591110.1%
 

ActBal_MF
Real number (ℝ≥0)

ZEROS

Distinct count190
Unique (%)11.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3887.5326592215833
Minimum0.0
Maximum761235.5042857144
Zeros1420
Zeros (%)87.9%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile10443.76093
Maximum761235.5043
Range761235.5043
Interquartile range (IQR)0

Descriptive statistics

Standard deviation34868.01017
Coefficient of variation (CV)8.969187715
Kurtosis311.2991156
Mean3887.532659
Median Absolute Deviation (MAD)0
Skewness16.60253721
Sum6278365.245
Variance1215778133
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0142087.9%
 
14285.7142930.2%
 
308.882142930.2%
 
23214.2857120.1%
 
169.674642920.1%
 
3020.64071410.1%
 
62103.2685710.1%
 
11342.5457110.1%
 
4107.86035710.1%
 
5560.71428610.1%
 
Other values (180)18011.1%
 
ValueCountFrequency (%) 
0142087.9%
 
17.7285714310.1%
 
107.251785710.1%
 
113.216785710.1%
 
125.125714310.1%
 
ValueCountFrequency (%) 
761235.504310.1%
 
714285.714310.1%
 
579084.107910.1%
 
446183.423210.1%
 
314723.16510.1%
 

ActBal_OVD
Real number (ℝ≥0)

ZEROS

Distinct count121
Unique (%)7.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean32.80650574966829
Minimum0.0
Maximum2055.325357142857
Zeros1495
Zeros (%)92.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile199.057
Maximum2055.325357
Range2055.325357
Interquartile range (IQR)0

Descriptive statistics

Standard deviation157.9264288
Coefficient of variation (CV)4.813875333
Kurtosis55.8318783
Mean32.80650575
Median Absolute Deviation (MAD)0
Skewness6.744763711
Sum52982.50679
Variance24940.75691
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0149592.6%
 
901.616071410.1%
 
104.011428610.1%
 
246.517857110.1%
 
676.752857110.1%
 
531.629642910.1%
 
31.557510.1%
 
40.2571428610.1%
 
874.1810.1%
 
248.484642910.1%
 
Other values (111)1116.9%
 
ValueCountFrequency (%) 
0149592.6%
 
0.694642857110.1%
 
8.44510.1%
 
8.86892857110.1%
 
12.3410.1%
 
ValueCountFrequency (%) 
2055.32535710.1%
 
1776.06535710.1%
 
1670.28071410.1%
 
1568.87964310.1%
 
1459.45714310.1%
 

ActBal_CC
Real number (ℝ)

ZEROS

Distinct count121
Unique (%)7.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.97846528084918
Minimum-15.479285714285714
Maximum3522.233571428572
Zeros1461
Zeros (%)90.5%
Memory size12.6 KiB

Quantile statistics

Minimum-15.47928571
5-th percentile0
Q10
median0
Q30
95-th percentile213.5798929
Maximum3522.233571
Range3537.712857
Interquartile range (IQR)0

Descriptive statistics

Standard deviation190.8695924
Coefficient of variation (CV)5.161641809
Kurtosis99.62910469
Mean36.97846528
Median Absolute Deviation (MAD)0
Skewness8.255315915
Sum59720.22143
Variance36431.20132
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0146190.5%
 
0.7142857143110.7%
 
2.357142857110.7%
 
1.17857142970.4%
 
1.42857142960.4%
 
0.464285714330.2%
 
1.78571428620.1%
 
212.115357110.1%
 
627.857142910.1%
 
-15.4792857110.1%
 
Other values (111)1116.9%
 
ValueCountFrequency (%) 
-15.4792857110.1%
 
-8.28321428610.1%
 
-7.73821428610.1%
 
-4.46428571410.1%
 
-1.42857142910.1%
 
ValueCountFrequency (%) 
3522.23357110.1%
 
2382.94607110.1%
 
1767.78535710.1%
 
1529.05892910.1%
 
1400.91857110.1%
 

ActBal_CL
Real number (ℝ≥0)

ZEROS

Distinct count135
Unique (%)8.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean354.23013467492257
Minimum0.0
Maximum20749.29464285714
Zeros1481
Zeros (%)91.7%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile2198.358179
Maximum20749.29464
Range20749.29464
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1678.743095
Coefficient of variation (CV)4.739131234
Kurtosis49.45970124
Mean354.2301347
Median Absolute Deviation (MAD)0
Skewness6.484250485
Sum572081.6675
Variance2818178.38
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0148191.7%
 
4250.11678610.1%
 
117.653928610.1%
 
1621.42178610.1%
 
2953.812510.1%
 
1038.69678610.1%
 
360.354642910.1%
 
7570.78392910.1%
 
930.357142910.1%
 
9115.11285710.1%
 
Other values (125)1257.7%
 
ValueCountFrequency (%) 
0148191.7%
 
63.8032142910.1%
 
117.653928610.1%
 
120.676071410.1%
 
194.803928610.1%
 
ValueCountFrequency (%) 
20749.2946410.1%
 
17300.4896410.1%
 
17062.3242910.1%
 
14708.5714310.1%
 
14516.5242910.1%
 

VolumeCred
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count1490
Unique (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1791.9436782397167
Minimum0.0
Maximum107703.80428571427
Zeros42
Zeros (%)2.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0.001071428571
Q1214.2860714
median639.4228571
Q31406.373214
95-th percentile5960.831929
Maximum107703.8043
Range107703.8043
Interquartile range (IQR)1192.087143

Descriptive statistics

Standard deviation5818.571603
Coefficient of variation (CV)3.247072814
Kurtosis165.0604121
Mean1791.943678
Median Absolute Deviation (MAD)532.2796429
Skewness11.24170399
Sum2893989.04
Variance33855775.49
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0422.6%
 
0.0003571428571251.5%
 
0.00107142857190.6%
 
0.000714285714370.4%
 
0.00714285714360.4%
 
0.00178571428650.3%
 
107.143214340.2%
 
0.00142857142940.2%
 
0.00214285714330.2%
 
0.00964285714330.2%
 
Other values (1480)150793.3%
 
ValueCountFrequency (%) 
0422.6%
 
0.0003571428571251.5%
 
0.000714285714370.4%
 
0.00107142857190.6%
 
0.00142857142940.2%
 
ValueCountFrequency (%) 
107703.804310.1%
 
98717.67510.1%
 
90124.6096410.1%
 
68371.5421410.1%
 
46756.5407110.1%
 

VolumeCred_CA
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count1432
Unique (%)88.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1480.2131904024766
Minimum0.0
Maximum98717.675
Zeros58
Zeros (%)3.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0.0003571428571
Q1192.0483929
median601.4339286
Q31288.683929
95-th percentile4498.145286
Maximum98717.675
Range98717.675
Interquartile range (IQR)1096.635536

Descriptive statistics

Standard deviation4625.10769
Coefficient of variation (CV)3.124622669
Kurtosis192.1602766
Mean1480.21319
Median Absolute Deviation (MAD)494.2875
Skewness11.98514851
Sum2390544.302
Variance21391621.15
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0583.6%
 
0.0003571428571342.1%
 
0.001071428571130.8%
 
0.000714285714380.5%
 
0.00714285714370.4%
 
0.00178571428670.4%
 
0.00285714285760.4%
 
0.007550.3%
 
107.143214340.2%
 
0.00321428571440.2%
 
Other values (1422)146991.0%
 
ValueCountFrequency (%) 
0583.6%
 
0.0003571428571342.1%
 
0.000714285714380.5%
 
0.001071428571130.8%
 
0.00142857142930.2%
 
ValueCountFrequency (%) 
98717.67510.1%
 
68667.552510.1%
 
66908.9510710.1%
 
54128.3792910.1%
 
46714.2921410.1%
 

TransactionsCred
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count45
Unique (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.445820433436532
Minimum0.0
Maximum63.0
Zeros42
Zeros (%)2.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q36
95-th percentile19
Maximum63
Range63
Interquartile range (IQR)4

Descriptive statistics

Standard deviation6.341432047
Coefficient of variation (CV)1.164458528
Kurtosis14.43840609
Mean5.445820433
Median Absolute Deviation (MAD)1
Skewness3.273911883
Sum8795
Variance40.2137604
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
235722.1%
 
328717.8%
 
419612.1%
 
11549.5%
 
51287.9%
 
6945.8%
 
7603.7%
 
8462.8%
 
0422.6%
 
9362.2%
 
Other values (35)21513.3%
 
ValueCountFrequency (%) 
0422.6%
 
11549.5%
 
235722.1%
 
328717.8%
 
419612.1%
 
ValueCountFrequency (%) 
6310.1%
 
5610.1%
 
4410.1%
 
4310.1%
 
4210.1%
 

TransactionsCred_CA
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count36
Unique (%)2.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.1913312693498455
Minimum0.0
Maximum48.0
Zeros58
Zeros (%)3.6%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q34
95-th percentile14
Maximum48
Range48
Interquartile range (IQR)2

Descriptive statistics

Standard deviation4.93249842
Coefficient of variation (CV)1.17683335
Kurtosis15.75007973
Mean4.191331269
Median Absolute Deviation (MAD)1
Skewness3.553063444
Sum6769
Variance24.32954067
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
246728.9%
 
330719.0%
 
120312.6%
 
418611.5%
 
51217.5%
 
6593.7%
 
0583.6%
 
8322.0%
 
7301.9%
 
10211.3%
 
Other values (26)1318.1%
 
ValueCountFrequency (%) 
0583.6%
 
120312.6%
 
246728.9%
 
330719.0%
 
418611.5%
 
ValueCountFrequency (%) 
4810.1%
 
3710.1%
 
3610.1%
 
3510.1%
 
3330.2%
 

VolumeDeb
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count1439
Unique (%)89.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1667.793195488722
Minimum0.0
Maximum119906.50392857144
Zeros94
Zeros (%)5.8%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1201.8528571
median641.8660714
Q31387.913393
95-th percentile5551.514607
Maximum119906.5039
Range119906.5039
Interquartile range (IQR)1186.060536

Descriptive statistics

Standard deviation5143.402322
Coefficient of variation (CV)3.083956893
Kurtosis209.9109184
Mean1667.793195
Median Absolute Deviation (MAD)533.8803571
Skewness11.94464244
Sum2693486.011
Variance26454587.44
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0945.8%
 
1.964285714352.2%
 
3.392857143140.9%
 
0.8928571429140.9%
 
0.893928571490.6%
 
14.2857142930.2%
 
17.8571428620.1%
 
53.5714285720.1%
 
1.42857142920.1%
 
35.7142857120.1%
 
Other values (1429)143889.0%
 
ValueCountFrequency (%) 
0945.8%
 
0.00510.1%
 
0.00892857142910.1%
 
0.0285714285710.1%
 
0.288928571410.1%
 
ValueCountFrequency (%) 
119906.503910.1%
 
64608.3614310.1%
 
59500.932510.1%
 
44782.1860710.1%
 
44078.1839310.1%
 

VolumeDeb_CA
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count1425
Unique (%)88.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1434.8866919504646
Minimum0.0
Maximum73477.93250000001
Zeros100
Zeros (%)6.2%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q1188.5982143
median607.8214286
Q31309.140893
95-th percentile4209.422429
Maximum73477.9325
Range73477.9325
Interquartile range (IQR)1120.542679

Descriptive statistics

Standard deviation4248.350539
Coefficient of variation (CV)2.960756806
Kurtosis127.4040695
Mean1434.886692
Median Absolute Deviation (MAD)504.25
Skewness10.03800148
Sum2317342.008
Variance18048482.3
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
01006.2%
 
1.964285714362.2%
 
3.392857143150.9%
 
0.8928571429150.9%
 
0.8939285714100.6%
 
14.2857142940.2%
 
67.8571428630.2%
 
714.285714320.1%
 
21.4285714320.1%
 
5.53571428620.1%
 
Other values (1415)142688.3%
 
ValueCountFrequency (%) 
01006.2%
 
0.000357142857110.1%
 
0.00510.1%
 
0.714285714310.1%
 
0.892142857110.1%
 
ValueCountFrequency (%) 
73477.932510.1%
 
64178.1853610.1%
 
59500.932510.1%
 
44782.1860710.1%
 
44078.1839310.1%
 

VolumeDebCash_Card
Real number (ℝ≥0)

SKEWED
ZEROS

Distinct count261
Unique (%)16.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean253.46535869084477
Minimum0.0
Maximum23571.42857142857
Zeros659
Zeros (%)40.8%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median71.42857143
Q3342.8571429
95-th percentile857.1428571
Maximum23571.42857
Range23571.42857
Interquartile range (IQR)342.8571429

Descriptive statistics

Standard deviation751.8874198
Coefficient of variation (CV)2.966430694
Kurtosis592.484666
Mean253.4653587
Median Absolute Deviation (MAD)71.42857143
Skewness20.57539539
Sum409346.5543
Variance565334.692
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
065940.8%
 
178.5714286593.7%
 
357.1428571483.0%
 
71.42857143291.8%
 
535.7142857271.7%
 
35.71428571261.6%
 
107.1428571261.6%
 
285.7142857251.5%
 
53.57142857241.5%
 
214.2857143221.4%
 
Other values (251)67041.5%
 
ValueCountFrequency (%) 
065940.8%
 
3.57142857120.1%
 
7.14285714350.3%
 
10.7142857170.4%
 
14.28571429100.6%
 
ValueCountFrequency (%) 
23571.4285710.1%
 
9714.28571410.1%
 
6428.57142910.1%
 
4513.75892910.1%
 
3535.71428610.1%
 

VolumeDebCashless_Card
Real number (ℝ≥0)

ZEROS

Distinct count874
Unique (%)54.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean148.23503825740823
Minimum0.0
Maximum3637.616785714286
Zeros733
Zeros (%)45.4%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median16.48214286
Q3174.1696429
95-th percentile670.8960714
Maximum3637.616786
Range3637.616786
Interquartile range (IQR)174.1696429

Descriptive statistics

Standard deviation309.8478018
Coefficient of variation (CV)2.090246716
Kurtosis39.79399338
Mean148.2350383
Median Absolute Deviation (MAD)16.48214286
Skewness5.073639949
Sum239399.5868
Variance96005.66031
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
073345.4%
 
10.7142857140.2%
 
14.2857142930.2%
 
46.3928571420.1%
 
32.1428571420.1%
 
7.10714285720.1%
 
155.642857120.1%
 
31.75510.1%
 
64.5035714310.1%
 
174.046428610.1%
 
Other values (864)86453.5%
 
ValueCountFrequency (%) 
073345.4%
 
0.890357142910.1%
 
1.06785714310.1%
 
1.78928571410.1%
 
2.0510.1%
 
ValueCountFrequency (%) 
3637.61678610.1%
 
3635.70142910.1%
 
3335.42607110.1%
 
3292.76510.1%
 
2470.33571410.1%
 

VolumeDeb_PaymentOrder
Real number (ℝ≥0)

ZEROS

Distinct count1066
Unique (%)66.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean703.5617430340557
Minimum0.0
Maximum72278.78214285713
Zeros463
Zeros (%)28.7%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median170.7142857
Q3495.2880357
95-th percentile2160.450286
Maximum72278.78214
Range72278.78214
Interquartile range (IQR)495.2880357

Descriptive statistics

Standard deviation3188.467264
Coefficient of variation (CV)4.531894031
Kurtosis263.2064008
Mean703.561743
Median Absolute Deviation (MAD)170.7142857
Skewness14.32428222
Sum1136252.215
Variance10166323.49
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
046328.7%
 
35.71428571110.7%
 
17.85714286100.6%
 
10.7142857170.4%
 
71.4285714370.4%
 
357.142857150.3%
 
107.142857150.3%
 
21.4285714340.2%
 
132.142857140.2%
 
53.5714285740.2%
 
Other values (1056)109567.8%
 
ValueCountFrequency (%) 
046328.7%
 
0.00510.1%
 
0.0357142857110.1%
 
0.714285714310.1%
 
1.07142857110.1%
 
ValueCountFrequency (%) 
72278.7821410.1%
 
61955.1039310.1%
 
35714.2857110.1%
 
30069.6964310.1%
 
27129.2142910.1%
 

TransactionsDeb
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count86
Unique (%)5.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.729411764705882
Minimum0.0
Maximum102.0
Zeros94
Zeros (%)5.8%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median11
Q322
95-th percentile48
Maximum102
Range102
Interquartile range (IQR)18

Descriptive statistics

Standard deviation16.23710532
Coefficient of variation (CV)1.032276703
Kurtosis4.084567854
Mean15.72941176
Median Absolute Deviation (MAD)8
Skewness1.817641998
Sum25403
Variance263.6435892
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11076.6%
 
0945.8%
 
3865.3%
 
2835.1%
 
6744.6%
 
4704.3%
 
5674.1%
 
7633.9%
 
9553.4%
 
8523.2%
 
Other values (76)86453.5%
 
ValueCountFrequency (%) 
0945.8%
 
11076.6%
 
2835.1%
 
3865.3%
 
4704.3%
 
ValueCountFrequency (%) 
10210.1%
 
9910.1%
 
9620.1%
 
9510.1%
 
9210.1%
 

TransactionsDeb_CA
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count71
Unique (%)4.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13.360990712074303
Minimum0.0
Maximum83.0
Zeros100
Zeros (%)6.2%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q14
median10
Q319
95-th percentile39
Maximum83
Range83
Interquartile range (IQR)15

Descriptive statistics

Standard deviation12.98418021
Coefficient of variation (CV)0.9717977127
Kurtosis3.712271308
Mean13.36099071
Median Absolute Deviation (MAD)7
Skewness1.693030528
Sum21578
Variance168.5889358
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11187.3%
 
01006.2%
 
3905.6%
 
2865.3%
 
5855.3%
 
4774.8%
 
6684.2%
 
9603.7%
 
7583.6%
 
11533.3%
 
Other values (61)82050.8%
 
ValueCountFrequency (%) 
01006.2%
 
11187.3%
 
2865.3%
 
3905.6%
 
4774.8%
 
ValueCountFrequency (%) 
8310.1%
 
7910.1%
 
7520.1%
 
7410.1%
 
7210.1%
 

TransactionsDebCash_Card
Real number (ℝ≥0)

ZEROS

Distinct count19
Unique (%)1.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9541795665634676
Minimum0.0
Maximum25.0
Zeros659
Zeros (%)40.8%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q33
95-th percentile7
Maximum25
Range25
Interquartile range (IQR)3

Descriptive statistics

Standard deviation2.699604047
Coefficient of variation (CV)1.381451374
Kurtosis8.62127873
Mean1.954179567
Median Absolute Deviation (MAD)1
Skewness2.366542932
Sum3156
Variance7.287862012
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
065940.8%
 
128917.9%
 
220212.5%
 
31398.6%
 
41116.9%
 
6623.8%
 
5503.1%
 
7352.2%
 
8191.2%
 
12140.9%
 
Other values (9)352.2%
 
ValueCountFrequency (%) 
065940.8%
 
128917.9%
 
220212.5%
 
31398.6%
 
41116.9%
 
ValueCountFrequency (%) 
2510.1%
 
2110.1%
 
1710.1%
 
1630.2%
 
1510.1%
 

TransactionsDebCashless_Card
Real number (ℝ≥0)

ZEROS

Distinct count47
Unique (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.149226006191951
Minimum0.0
Maximum60.0
Zeros733
Zeros (%)45.4%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median1
Q37
95-th percentile24
Maximum60
Range60
Interquartile range (IQR)7

Descriptive statistics

Standard deviation8.341199628
Coefficient of variation (CV)1.619893867
Kurtosis6.392143482
Mean5.149226006
Median Absolute Deviation (MAD)1
Skewness2.338394148
Sum8316
Variance69.57561123
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
073345.4%
 
11257.7%
 
2986.1%
 
3674.1%
 
4603.7%
 
6573.5%
 
9472.9%
 
7472.9%
 
8432.7%
 
5352.2%
 
Other values (37)30318.8%
 
ValueCountFrequency (%) 
073345.4%
 
11257.7%
 
2986.1%
 
3674.1%
 
4603.7%
 
ValueCountFrequency (%) 
6010.1%
 
5910.1%
 
4810.1%
 
4720.1%
 
4610.1%
 

TransactionsDeb_PaymentOrder
Real number (ℝ≥0)

ZEROS

Distinct count30
Unique (%)1.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4.521362229102167
Minimum0.0
Maximum34.0
Zeros463
Zeros (%)28.7%
Memory size12.6 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median3
Q37
95-th percentile15
Maximum34
Range34
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.200860965
Coefficient of variation (CV)1.150286286
Kurtosis3.02599718
Mean4.521362229
Median Absolute Deviation (MAD)3
Skewness1.579147324
Sum7302
Variance27.04895477
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
046328.7%
 
116510.2%
 
21388.5%
 
31227.6%
 
51086.7%
 
41036.4%
 
6784.8%
 
8633.9%
 
7603.7%
 
10523.2%
 
Other values (20)26316.3%
 
ValueCountFrequency (%) 
046328.7%
 
116510.2%
 
21388.5%
 
31227.6%
 
41036.4%
 
ValueCountFrequency (%) 
3410.1%
 
3310.1%
 
3120.1%
 
2610.1%
 
2520.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

ClientSexAgeTenureCount_CACount_SACount_MFCount_OVDCount_CCCount_CLActBal_CAActBal_SAActBal_MFActBal_OVDActBal_CCActBal_CLVolumeCredVolumeCred_CATransactionsCredTransactionsCred_CAVolumeDebVolumeDeb_CAVolumeDebCash_CardVolumeDebCashless_CardVolumeDeb_PaymentOrderTransactionsDebTransactionsDeb_CATransactionsDebCash_CardTransactionsDebCashless_CardTransactionsDeb_PaymentOrder
09090212710.00.01.00.01.04.7107140.0000000.0000000.00000.0000004291.996429789.129643738.2300004.03.0450.678571448.892857178.5714290.000000166.5714298.07.01.00.04.0
1121703816510.00.00.00.00.06752.2446430.0000000.0000000.00000.0000000.0000000.0021430.0021431.01.0714.285714714.2857140.0000000.000000714.2857141.01.00.00.01.0
28501494410.00.00.00.00.043.5232140.0000000.0000000.00000.0000000.0000001392.4028571392.4028573.03.01226.3453571226.3453570.0000000.000000121.9285716.06.00.00.01.0
314730543411.00.00.01.01.029.02428614447.8014290.0000000.0000653.9100001132.5903571787.127500939.12892914.05.03875.1378573794.580714357.142857444.5975002076.78571448.038.01.026.011.0
4103802910610.00.00.00.00.027.0357140.0000000.0000000.00000.0000000.0000000.0060710.0060711.01.00.0000000.0000000.0000000.0000000.0000000.00.00.00.00.0
522501418710.00.01.00.00.0345.6860710.0000000.000000618.39750.0000000.0000000.0075000.0075001.01.0130.521429130.5214290.000000111.23571419.2857145.05.00.04.01.0
669903717510.04.01.00.00.01823.0571430.00000018491.4442860.00000.0000000.0000001033.496071778.3700008.06.0661.483214566.12607189.2857140.000000216.89285713.010.02.00.05.0
78270575010.00.01.01.00.049.1935710.0000000.0000000.00000.4642860.0000001755.2817861750.4042869.06.01474.3214291455.035714607.14285717.857143843.25000026.023.04.01.017.0
812311623210.00.00.00.00.0819.8539290.0000000.0000000.00000.0000000.0000000.0157140.0157142.02.02257.0000002257.0000002250.0000000.0000000.0000009.09.06.00.00.0
95281197010.00.01.00.00.00.0000000.0000000.0000000.00000.0000000.000000435.682143435.6821432.02.0390.056429390.056429125.00000070.842143190.82142910.010.04.03.02.0

Last rows

ClientSexAgeTenureCount_CACount_SACount_MFCount_OVDCount_CCCount_CLActBal_CAActBal_SAActBal_MFActBal_OVDActBal_CCActBal_CLVolumeCredVolumeCred_CATransactionsCredTransactionsCred_CAVolumeDebVolumeDeb_CAVolumeDebCash_CardVolumeDebCashless_CardVolumeDeb_PaymentOrderTransactionsDebTransactionsDeb_CATransactionsDebCash_CardTransactionsDebCashless_CardTransactionsDeb_PaymentOrder
160583303615010.00.00.00.00.03017.6414290.0000000.0000000.0000000.00.0000000.0000000.0000000.00.00.0000000.0000000.0000000.0000000.0000000.00.00.00.00.0
16065731567710.00.00.00.00.01243.5057140.0000000.0000000.0000000.00.000000760.966429760.9664293.03.0713.569643713.569643357.142857114.962500228.25000014.014.03.04.04.0
160762113715211.00.00.00.00.01249.7096430.0014290.0000000.0000000.00.0000007.3978570.0025003.01.00.2889290.0000000.0000000.0000000.0000001.00.00.00.00.0
160846406815320.00.01.00.00.02249.0496430.0000000.000000237.2971430.00.000000811.332857811.3328573.03.0577.385714577.385714160.71428665.421429347.85714312.012.02.03.06.0
1609127602814510.00.00.00.02.013915.9253570.0000000.0000000.0000000.01764.589286610.893214514.3132144.03.0165.928571155.10714335.7142860.0000000.0000006.04.01.00.00.0
16104090319110.00.01.00.00.0348.4028570.0000000.0000000.0000000.00.000000469.179643469.1796433.03.0465.092857465.092857178.57142911.414286271.71428612.012.01.01.09.0
161138402316010.00.00.00.00.02418.8767860.0000000.0000000.0000000.00.00000087.50035787.5003572.02.088.44392988.44392950.00000037.3725001.0714298.08.02.05.01.0
16129770465910.00.00.00.00.02639.3085710.0000000.0000000.0000000.00.00000071.42857171.4285711.01.076.10357176.10357175.0000001.0678570.0357144.04.02.01.01.0
161362916117310.02.00.00.00.061.7667860.00000034387.5835710.0000000.00.0000001064.1900001064.1900003.03.0817.462143817.462143660.71428641.355000115.17857117.017.03.05.08.0
161414660639710.00.01.00.00.021.6275000.0000000.0000000.0000000.00.000000742.597143742.5971434.04.0624.428571624.42857171.42857123.214286526.3928576.06.01.01.03.0